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DESCRIPTION 

DXDE0X7 DYE TERHIlilATORS 

FTFITiT? 9^ ™^ TUVENTION 
This invention relates to dye terminator nucleic 
acid sequencing and reagents for such sequencing* 

pfirynROUND OF THK INVENTION 
5 The following is a discussion of the relevant art, 

none of which is admitted to be prior art to the 
appended claims. 

Sequence reaction products must be labeled. This 
can be done xising labeled primers, labeled nucleotides 
10 (usually radioactive dNTPs) or labeled ddNTP 

terminators. The use of labeled terminators has the 
advantage of leaving false-stops undetectable. 

DNA sequence bands do not necessarily have xmiform 
intensities. It is txseful to e3q>i:ess band intensity 
15 variability numerically. This can be done by reporting 
the ratio of maximum to minimum intensity of nearby 
bands (within a window of perhaps 40 bases) in a DNA 
sequence or, with normalization and correction for 
systematic "drift" in intensity by reporting the root 
20 mean square of band intensities (typiccd-ly peak 

heights) (Fuller, C.W., Comments 16(3) :l-8, 1989). It is 
advantageoiis to have uniformity of band intensity as 
sequence accuracy and read-length is improved with bands 
of more imiform intensity. 
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For accxirate reading, the mobility of any given 
sequencing reaction product must migrate through the 
electrophoresis gel with a speed proportional only to 
its length. Products which migrate faster or slower 
5 than normal for a given length will result in sequence 
ambiguities or errors known as "compressions". 

Anomalous migration speed can be caused by 
secondary structure of the DNA and is apparently the 
cause of most "conpression" artifacts seen in 
10 radioactive-label (and other) sequencing experiments at 
GC-rich regions. These can often be resolved by the use 
of analogs of dGTP such as 7-deaza-dGTP or dITP. 
Another compression- like artifact is observed when some 
dye- labeled ddNTPs are used for sequencing. Several 
15 examples of this can be seen in Lee, L.G., Connell, 
C.R., woo, S.L., Cheng, R.D., McArdle, B.F., Fuller, 
C.W., Halloran, N.C., and Wilson. R.K., NUcleic Acids 
Res., 20:2471-2483, 1992 (see figures 4g, 4h and 6h 
using ddCTP labeled with tetramethylbodipy and TMR or 
20 ddGTP labeled with bifluor) . These compression-like 
artifacts are produced, even in sequences which are 
compression- free when sequenced radioactively or with 
dye-labeled primers. These artifacts can sometimes be 
eliminated by substituting dITP for dGTP or alpha^thio 
25 dNTPs for normal dNTPs (Lee, L.G. et al.. Nucleic Acids 
ReB., 20: 2471-2483, 1992; U.S. Patent No. 5,187,085). 
Similar artifacts seen with the fluorescein dye-labeled 
ddNTPs sold by Applied Biosystems for dye -terminator 
sequencing with T7 DNA polymerase are resolved by 
30 substituting alpha-thio dNTPs for normal dNTPs (Lee, 
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L.G. et al.. Nucleic Acids Res., 20: 2471-2483, 1992; 
U.S. Patent No. 5,187,085). 

Prober, J.M. , Trainor, G.L-, Dam, R.J., Hobbs, 
F.W., Robertson, C.W., Zagursky, R.J., Cocuzza, A.J., 
5 Jensen, M.A. and Bauineister, K., Science 238:336-41 
(1987) performed sequencing using terminators labeled 
with siibstituted succinyl- fluoresceins with linkers of 
10 atoms in length, together with dATP, dCTP, dTTP, 7- 
deaza-dGTP and AMV reverse transcriptase, and a 
10 fluorescence-detecting instrument. From Fig. 6 of this 
paper is clear that overall band intensities varied by 
more than 10 -fold, far more than the best available 
current methods with dye primers or radioactive labels. 
Dideoxy NTP terminators that have the same basic 
15 structure as the Prober et al. (1987) terminators, but 
have four rhodamine dyes used in place of the succinyl 
fluoresceins and linkers of 5 atoms in length, have been 
used for sequencing with Taq polymerase. In order to 
use these terminators, dITP is used in place of dGTP or 
20 7-deaza-dGTP to eliminate severe "compression" 

artifacts. This method has been practiced using cloned 
Taq DNA polymerase (Bergot , WO 9105060; Parker, L.T., 
Deng, Q, Zalceri, H., Carlson, C. Nickerson, D.A., Kwok, 
P.Y., Blotechniques 19 (1) :116-121, 1995) and with a 
25 mutant of Taq polymerase (D49G, AmpliTaq CS) lacking 5'- 
3' exonuclease activity. However, band intensities vary 
by as much as 20-fold, limiting the accuracy and readr 
length possible with the method (Parker, L.T., Zakeri, 
H., Deng, Q., Spurgeon, S., Rwok, P.Y., Nickerson, D.A., 
30 Blotechniques 21(4) : 694-699, 1996) . 
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Lee, L.G., Connell, C.R.. Woo, S.L., Cheng, R.D., 
McArdle, B.F., Fuller, C.W., Hallorand, N.D. and Wilson, 
R.K., Nucleic Acids Rea.. 20:2471, 1992) describe 
sequencing with a set of ddNTP terminators and T7 DMA 

5 polymerase. All have fluorescein- type dyes attached to 
the ddNTPs in essentially the same manner as the 
rhodamine terminators used for Taq sequencing. These 
are used with modified T7 DNA polymerase (Sequenase™ 
version 2.0) and alpha-thio dUTPs. The thio dNTPs are 

10 used to resolve the "con?)ression« artifacts like dITP is 
used for the Taq dye -terminator methods. The results 
with this system are such that bands vary in intensity 

about 10-fold. 

Wayne Barnes has published a protocol for dye- 
15 terminator sequencing with FY modified polymerases and 
Mn^* (Scientech Corp. St. Louis, MO). Bands are more 
uniform with this method varying about 4.5-fold at most. 

Fluorescein- 12 ddNTPs that have a linker length of 
12 atoms and Biotin-11 ddNTPs that have a linker length 
20 of 11 atoms are available (Dupont NEN, Wilmington, DB) . 
These labeled ddNTPs are described as useful in 
sequencing reactions. 

ABI PRISM disclose dichlororhodamine dyes linked to 
terminators by propargyl/ethylene oxide/amino CEO") 
25 linkers eight atoms in length for sequencing (Rosenblum, 
B.B., Lee, L.G., Spurgeon, S.L., Khan, S.H., Menchen, 
S.M., Heiner, C.R., and Chen, S.M., Wucleic Acids Res. 
25(22) •.4500-4504, 1997). 
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Cyanine dyes have been utilized in dye terminators 
for sequencing (Lee et al.. Nucleic Acids Res., 
20(10) :2471, 1992). 

BTTMMARY O P THB INVENTIQH 

5 The present invention provides novel dideoxy dye- 

labeled terminators which are useful in a number of 
biological processes, including providing uniform band 
intensities and the resolution of dye-induced 
compression artifacts in DNA sequencing. The dideoxy 
10 dye-labeled terminators of the present invention are 
particularly well suited for use with DNA polymerases 
that are thermostable or which contain an altered dNMP 
binding site (Tabor et al., U.S. Patent No. 5,614,365). 
Use of the terminators of the present invention for 
15 sequencing does not require the use of nucleotide 
analogs such as dITP or alpha-thio nucleotides to 
eliminate dye-induced conqpress ion artifacts, ^plicant 
has sxjrprisingly found that there is a strong 
correlation between t he length of the link between the 
20 dye molecule and the nucleotide and band u niformity, but 
l ittle correlation between the type of dye (or other 
p^m«>t-.^i-B ) and band uniformit y. Dye termina tors_witil-, 
linkers of 10 or more atoms (extenc 



atoms (10, 1 vfcen used in sequencing 

25 reactions prod uce^a nds in sequencing gels of 

signif icantly <TT™^l_IZr^^°"°^^ V romnared w i th dy e 
t erminators with linkers less_ tl3an 10 atCTPS. 

The dye termininators of the present invention with 
extended linkers typically are provided in groups of 
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foiir (ATGC) with or without a thermostable DNA 
polymerase and are especially iiseful in a method of 

sequence analysis. 

In a first aspect, the invention features a kit for 
5 DNA sequencing having a first, second, third and fourth 
dye terminator molecule, each of the dye terminator 
molecules has a dye molecule, a JjjgJcer_of. at least 10 
at oms in length and either ddATP, ddCTP, ddGTP or ddTTP 
as a mono or tri -phosphate and a thermostable DNA 

10 polymerase . 

By "dye molecule* is meant any molecule that has a 
detectable emission spectrum, including but not limited 
to fluorescein, rhodamine, texas red, eosin, lissamine, 
coumarin, cyanine, and derivatives of these molecules. 
15 Dyes also include energy transfer dyes each comprising a 
donor and an acceptor dye. 

By "linker" is meant a rf iain of at least 10 atoms 
r^gmprjB-in g carbou. nitrogeu, and oxygen which linlcs th e 
dve molecule with the dideoxynucleotide . The^^ainmay^ 
20 also contain subst ituted ^rbon or sulfur . Linkage 
typically occurs at the aromatic base moiety of the 
nucleotide. The first two atoms of the linker attached 
to the base are typically joined in a triple bond. 

By "substituted carbon " is meant that one or more 
25 hydrogens are replaced with a substitute grooflp such as, 
but not limited to, hydroxyl, cyano, alkoxy, oxygen, 
sulf\ir, nitroxy, halogen, -N(CH,)a, amino, and -SH. 

By "thermostable DNA polymerase" is meant a DNA 
polymerase has a half-life of greater than 5 minutes at 
30 aO*C. Such polymerases include, but are not limited to, 
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DNA polymerases encoded by TberxmiB aquaticuB, ThenauB 
tbermophiluB, Tbezmaa flavus, ThennococcuB littxjralia, 
Pyrococcua furioaua, Therwotoga maritima, and Tberaotoffa 
neapolitana. 

5 In a preferred embodiment the thermostable DNA 

polymerase has an altered dNMP binding site so as to 
in5>rove the incorporation of dideoxynucleotides relative 
to the natural polymerase. A DNA polymerase with an 
altered dNMP binding site does not discriminate 
10 significantly between dideoxynucleotides and 

deoxynucleotides. The chance of incorporating a 
dideoxynucleotide is approximately the same as that of a 
deoxynucleotide or at least l/lO the efficiency of a 
deoxynucleotide . 

15 In a second aspect the invention features a 

compound of formula (I) 

A 

I 

B 



A is a cyanine dye of the structure 




wherein the curved lines represent carbon atoms 
necessary for the formulation of cyanine dyes; X axxd 
20 are selected from the grot^) consisting of O, S, and 



wo 99/40223 



PCTAJS99/02104 



8 

CHj-C-CHj; m is an integer selected from the group 
consisting of 1. 2, 3, and 4; Rl, R2, R3, R4, R5, R6, 
and R7 are independently selected from the group 
consisting of H, OH, CO^, sulfonic acid or sulfonate 

5 groups, esters, amides, ethers, alkyl or aryl groups^ 
and B and one Rl, R2, R3, R4, R5, R6 or R7 is B. 

B is a linker of at least 10 atoms in length 
wherein the atoms are selected from the group consisting 
of carbon, nitrogen, oxygen, substituted carbon and 

10 sulfxrc and the linker is attached at one end to A and at 
the other end to C. 

C is a dideoxynucleotide selected from the group 

consisting of: 




and wherein the linker is covalently bonded to the 
dideoxynucleotide at position 7 for the purines (ddG, 
ddA) and at position 5 for the pyrimidines (ddT, ddC) 
and wherein r is a mono or tri -phosphate. 

The term •sxU.fonic acid or sulfonate groups' refer 
to SO3H groups or salts thereof. 

The term •ester- refers to a chemical moiety with 
formula -(R)n-COOR', where R and R' are independently 
selected from the group consisting of saturated or 
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unsaturated alkyl and f ive-membered or six-membered aryl 
or heteroaryl moieties and where n is 0 or !• 

The term *amide'' refers to a chemical substituent 
of formula -NHCX)R, where R is selected from the group 
5 consisting of hydrogen, alkyl, hydroxyl, and five- 
membered or six-membered aryl or heteroaryl ring 
moieties, where the ring is optionally substituted with 
one or more substituents independently selected from the 
group consisting of alkyl, halogen, tr ihaloraethyl , 
10 carboxylate, nitro, or ester. 

The term ~ether' refers to a chemical moiety with 
formula R-O-R' where R and R' are independently selected 
from the group consisting of saturated or unsaturated 
alkyl and f ive-membered or six-membered aryl or 
15 heteroaryl moieties and where n is 0 or 1. 

The te3cm "alkyl* refers to a straight -chain or 
branched aliphatic hydrocarbon. The alkyl group is 
preferably 1 to 10 carbons, more preferably a lower 
alkyl of from 1 to 7 carbons, and roost preferably 1 to 4 
20 carbons. Typical alkyl groups include methyl, ethyl, 
propyl, isopropyl, butyl, isobutyl, tertiary butyl, 
pentyl, hexyl and the like. The alkyl group may be 
substituted and some typical alkyl substituents include 
hydroxyl, cyano, alkoxy, oxygen, soilfur, nitiroxy, 
25 halogen, -NCCHa)^, amino, and -SH. 

The term "^aryl" refers to an aromatic group which 
has at least one ring having a conjugated pi electron 
system and includes both carbocyclic aryl (e.g. phenyl) 
and heterocyclic aryl groups (e.g. pyridine) . The term 
30 "carbocyclic'' refers to a compound which contains one. or 
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more covalently closed ring structxires, and that the 
atoms forming the backbone of the ring are all carbon 
atoms. The term thus distinguishes carbocyclic from 
heterocyclic rings in which the ring backbone contains 
5 at least one atom which is different from carbon. The 
term *heteroarly* refers to an aryl groxip which contains 
at least one heterocyclic ring. 

In a preferred embodiment the linker is selected 
from thegrou p consisting of ; 

10 -CeC-CHa-NH-CO- (CH,) j-NH-CO-, 
-CEC-GHa-NH-CO- (CHa) ,-MH-SOa- # 
-CeC-CHa-HH-CO- (CHa) lo-NH-CO-, 
-CEC-CHa-HH-CO- (CHa) * 

-CeC-CHa-NH-00- (CHa) s-NH-00- (CHa) 5- # and 
15 -C=C-CHa-NH-O0- (CHa) 5-NH-CO- (GH,) 10-NH-CO-^ 



In preferred embodiments the dideoxy dye 
terminators are; a compoxind of the formula {II) : 
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The ddGTP and ddCTP con^xjunds have a linker 

5 of 10 atoms in length. The Cy-5.5 ddCTP and ddTTP 
coir?>ounds have a linker of 17 atoms in length. 

In a third aspect the invention features a 
deoxyribonucleic acid sequence containing the compound 
of formula I, II, III ,IV or V. 
LO In a preferred embodiment the invention features a 

kit for DNA sequencing comprising compounds of formula 
II, III, IV, and V. 



wo 99/40223 



PCTAJS99/02104 



14 



In a further preferred embodiments the kit further 
has a thermostable DNA polymerase; the thermostable DNA 
polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
5 to the natxiral polymerase. 

A^l icant has ^surpri singly found that the one 
par ameter that most strongly correlates with b^ d_ 
,^rMfr^-nni f y la th ^ 1 *>nfffh of the linker between the dye 
an d the ddNTP. . Appl icant has found that by extendi^ [_ 
10 the linker leng th between the dye an d_th e nucleotide for 
any dveiddN TP combination to at least 10 atoms, that 
band uniformity is substantially improved a nd there are 
nrt^H^^-jn ^uced compression etrtifacts. 

Thus, in a fourth aspect, the invention features a 
15 method for determining the nucleotide base sequence of a 
DNA molecule consisting of the steps of incubating a DNA 
molectile annealed with a primer molecule able to 
hybridize to the DNA molecrde in a vessel containing a 
thermostable DNA polymerase, a dye terminator with a 
20 linker of at least 10 atoms between the dye and the 

nucleotide and separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of the DNA molecule can be 
determined. 

25 In preferred embodiments, the dye terminator is a 

compound of formula 1, 11, 111, TV or V; the 
thermostable DNA polymerase has an altered dNMP binding 
site. 



wo 99/40223 



PCTAJS99/02104 



15 

Other featxires and advcintages of the invention will 
be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

All articles, ptablications and patents cited in 
5 this application are hereby incorporated by reference, 
in their entirety. 

rn^TEP DBSC PTPTTQN OF THE FIGURES 

Pig. 1 presents DNA sequence data generated xising 
M13nipl8 containing a 115 bp SaiAI fragment from lambda 
10 inserted a the BaraHI site and Cy5.5 ddGTP, ddATP, ddTTP, 
and ddCTP dye terminators. 

Fig. 2 is a graph of band intensity variability 
(rms) vs linker length (atoms) . 

HRffrRTPTIO N OF THR PRRFKPRED EMBOPIMPWTS 
15 The following Examples are provided for further 

illustrating various aspects and embodiments of the 
present invention and are in no way intended to be 
limiting of the scope. 

Exanple 1: Syntheais nf dideoxv dve terminators 

Cy 5.5 dideoxvnuc ?l*>nflide trir^hosphates 
Dye terminators labeled with Cy5.5 were prepared 
from propcirgylaminodideoxynucleotids (Prober, J.M., 
Trainor, G.L., Dam, Hobbs, F.W., Robertson, C.W., 

Zagursky, R.J./ Cocuzza, A. J. , Jensen, M.A. and 
Baumeister, K., Science 238:336-41 (1987); U.S. Patent 
Nos. 5,242,796, 5,306,618, and 5,332,666) and *CyDye 
Fluorolink Cy5.5 mono reactive dye* product PA25501 
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(Amersham Life Science) to produce coirpoxinds II, III, 
IV, and V. In the case of ddG and ddA, the 
propargylaminonucleotide was directly reacted with the 
N-hydroxysuccinimidyl ester of the Cy5-5 dye. In the 
5 case of ddC and ddT, a longer linker was constructed by 
reacting the propargylaminonucleotide with the N- 
hydroxysuccinimidyl ester of N-trif luoroacetyl-6- 
aminocaproic acid followed by hydrolysis in aqueous 
annnonia of the trifluoroacetyl group. The resulting 
10 compound was then reacted with the N-hydroxysuccinimidyl 
ester of the CyS.S dye to give the 17 -atom linker 
between the Cy 5.5 dye and the pyrimidine base. 

In addition to Cy 5.5 dyes, those who practice the 
art would know how to identify and utilize other dyes, 
15 including other cyanine dyes, with the appropriate 
optical properties. Also, the construction and 
attachment of various linkers is well known in the art. 
Suitable reagents for linker construction include one or 
more compounds consisting of activated forms of amino- 
20 protected alkyl or aryl amino acids such as compounds of 
the formula R-NH- (CHj) „-COaR' or R-NH- (CH,) JC(CHa).-CX)jR' , 
where R is an acid- or base-labile protecting group, R' 
is a reactive ester or anhydride group, X is aryl, 0> S, 
or NH, and where n and m are 0-12. Other linkers 
25 constructed by N- or O- or S- alkylation are also 
suitable. The exact linker length, of at least 10 
atoms, for a specific dye and dideoxynucleotide 
combination can be determined empirically by monitoring 
band uniformity in DMA sequencing as described (see 
30 Example 3) . 
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Exaii?)le 2: pyf> ^.erm^nat:or cvcl p Bftm^encinq 

DNA cycle sequencing was carried out using Thermo 
Sequenase™ DNA polymerase (Amersham, Cleveland, OH) and 
Cy5.5 dideoxy dye terminators using the following cycle 
5 sequencing protocol: 

1. A master mix was prepared consisting of the 
following: 

Template DNA 5.0^*1 

lOX Reaction buffer (see below) 3,5a*1 

10 Primer, 2iM 1.0/xl 

Polymerase (see below) 2/z 

H,0 15.5/xl 

Total volume 27.0/il 



lOX Reaction Buffer: 
15 150 mM Tris HCL pH 9.5 

35mM MgCla 

Polymerase: Thermo Sequenase"" DNA polymerase, 
10U//il, 0.0017U//X1, Thermoplasma acidophilvm inorganic 
pyrophosphatase: 20nM Tris-HCl, pH 8.5, ItriM DTT, O.lmM 
20 EDTA, 0-5% Tween-20, 0.5% Nonidet P-40 and 50% glycerol. 
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2. Foiir microcentrifuge tubes were labeled and 2 fil of 
Cy5.5 labeled ddG, ddA, ddT, ddC solution was added to 
each tube. 

25:1 ddG Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
5 12/iM Cy5.5 ddGTP 

25:1 ddA Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 

12/iM Cy5.5 ddATP 

25:1 ddT Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12AtM Cy5.5 ddTTP 
10 25:1 ddC Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12/iM Cy5.5 ddCTP 

3. Six fJl of the master mix (from step 1) was 
aliquoted to each of the 4 tubes from step 2 above. 
Cycling was carried out as follows: 95"C (30 sec), 45- 

15 55°C (30 sec) and 72'C (60 sec) for 35 cycles then 
incubate at 72°C 5-7 minutes. 

4. One fil of 8M ammonium acetate was added to each 
tube. Theii 27 nl (approximately 3 times the reaction 
volume) of chilled 100% ethanol was added. Then mixture 

20 was mixed and placed on ice for 20 minutes to 
precipitate the DNA. 

5. The mixture was centrifuged in a microcentrifuge 
(-12,000rpm) for 20-30 minutes at either room 
tenperature or 4**C. The supernatant was removed and 

25 then 200 /zl of 70% ethanol was added to wash the DNA 
pellet. 



wo 99/40223 



PCT/US99/02104 



19 

6. The mixture was again centrifuged for 5 minutes, 
the supernatant removed and the pellet dried (in a 
vacuum centrifuge) for 2-3 minutes. 

7. Each pellet was resuspended in 6 fil of formamide 
5 loading dye (Amersham, Cleveland, OH) , vortexed 

vigorously (10-20 sec) to ensure that all DNA was 
dissolved. The mixture was briefly centrifuged to 
collect the sample at the bottom of the tube. 

8. samples were heated to 70»C for 2-3 minutes to 
10 denature the DNA, then placed on ice. 

9. Then 1-5-2 fil of the volume was loaded onto a lane 
of the sequencing gel, and the gel run on the MICRO C3ene 
Blaster instrument (VGI) . 

For this sequence, the tenq?late DNA was M13n>pl8 
15 containing a 115 bp Sau3AI fragment from bacteriophage 
lambda inserted at the BamHI site (product number US 
70171 Amersham) . The primer is the -40 Forward 23-mer 
universal primer {5'-GTTTTCCCAGTCACGACGTTGTA-3') (SEQ. 
ID. NO. 1). Resxilts are shown in Figure 1. 

20 Exan?)le 3: r^^^^iat-ion of linker 1enqr,ri and band 

intensit y v^^-i ability 
Sequencing reactions were carried out as described 
in example 2 with various dye molecules linked to 
dideoxynucleotides with linkers of various lengths (see 
25 Table 1) . The labeled DNA products were then separated 
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on denaturing polyacrylamide gels and the labeled 
products were detected by fluorescence. The intensity 
of the bands is taken as the height of the peaks in a 
graph of fluorescence (in arbitraiy units) against time. 
5 Typically, systematic variations in peak heights can be 
seen in graphs of peak heights plotted sequentially. 
These systematic variations in the peak heights can be 
modeled by least-square fitting to a second-order 
polynominal function. Dividing the peak height for each 
10 band by the value of the curve- fit polynomial function 
yields a normalized band intensity for each peak. 
Variation in these band intensities can be esqiressed as 

the square root of the variance ^(nSx=^- (Ex) ^/n^) of the 

normalized peak heights, which can typically have values 
15 between 0 and 1 with more variability represented by 

higher numbers (Fuller, C.W., Comments 16 (3) : 1-8, 1989). 
This value is numerically equal to root -mean- square 
(RMS) value when 1.0 is subtracted from the normalized 
peak heights. These values are reported in Table 1 and 
20 graphed in Fig. 2. Variability of band intensities is 
sigixif icantly reduced when linkers of 10 or more atoms 
in length were used, resulting in sequence data that was 
easier to interpret accurately. 



Table 1 





Base , 


Dye* 


Linker 
Iiengttf' 


Band IRalfozmlty 
(xms) 


1 


T 


Coumarin 




0.32 


2 


G 


IiisBamine 


5« 


0.77 
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RllO 



R6G 



0.34 



0.32 



R60 



0.57 



ROX 



0.36 



TMR 



0.47 



TXR 



0*61 



Eos in 



0.40 



10 



Cy3 



10* 



0.24 



11 



CyS 



10* 



0.15 



10 



12 



CyS 



10* 



0.21 



13 



14 



Cir5.5 



CyS. 5 



10* 



10* 



0.21 



0.20 



15 



Fl 



12* 



0.16 



16 



Fl 



12* 



0.20 



15 



17 



Fl 



12« 



0.17 



le 



Fl 



12' 



0.18 



19 



R66 



12' 



0.13 



20 



R66 



12' 



0.25 



20 



21 



22 



ROX 



ROX 



12' 



12' 



0.21 



0.16 



23 



TOR 



12' 



0.26 



24 



TMR 



12' 



0.29 



25 



TMR 



12' 



0.37 



26 



16« 



0.32 



25 



27 



TXR 



16» 



0.24 



28 



T3CR 



163 



0.22 



29 



16» 



0.24 



30 



Cy3-Cy5 



17^ 



0.11 



31 



Cy3-Cy5 



171 



0.16 



30 



32 



Cy3-Cy5 



17* 



0.22 
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5 



10 



33 


T 


Cy3-C>r5 


17i 


0.11 


34 


C 


Cy5 


17^ 


0.14 


35 


T 


Cy5 


17i 


0.10 


36 


C 


Cy5.5 


17^ 


0.20 


37 


T 


Cy5-5 


171 


0.18 


38 


A 


Fl 


17'* 


0.16 


39 


C 


Fl 


17** 


0.24 


40 


G 


Fl 


17b 


0.18 


41 


T 


Fl 


17»» 


0.25 


42 


T 


Fl 


24*^ 


0.24 



• Abbreviations for dyes: Fl, Carbox>^luorescein; RllO, Rhodamine 
110; R6G, Rhodandne 6G; ROX, Rhodaraine X; TMR, tetramethylrhodamine ? 
TXR, Texas Red (Molecular Probes). The dyes Cy3, Cy3.S, Cy5 and 
CyS.B were from Amersham Life Science, Cleveland, OH. 



15 >» Linker length is the number of atoms between the ring structure 
the nucleoside base (A, C, G or T) and the ring structure of the 
dye. 

Linker structures 
c -CsC-CHj-NH-CX)- 
20 -CeC-CHj-NH-SOa- 

« -CfiC-GHa-UH-CS-HH- 
« -CEC-CHa-HH-CX)- (CHj) j-HH-CO- 
fl -CHC-CHa-HH-CX)- (CHa) ,-KH-SOa- 
^ -CbC-CHj-MH-OO- (CHa) ig-NH-CX)- 

25 ' -CEC-CH3-NH-CO-(CHa)5- 

1 -CEC-CHa-»H-CO- (CHa) g-NH-CO- (CHa) s" 

-CBC-CHa-KH-00-(CHa)8-»H-CO-{CHa)io-NH-CO- 



wo 99/40223 



23 



PCT/US99/02104 



CLAIMS 

1. A kit for DNA sequencing comprising: 

a first, second, third and fourth dye terminator 
molecule, each of the dye terminator molecules cotnprising 
5 a dye molecule, a linker of at least 10 atoms in length 
and either ddATP, ddCTP, ddGTP or ddTTP as a mono or tri- 
phosphate and a thermostable DNA polymerase. 

2. . The kit of claim 1, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

10 binding site so as to improve the incorporation pt 
dideoxynucleotides relative to the natural polymerase. 

3 . A coitqpound of formula (I) : 

A 

1 

B 



15 wherein A is a cyanine dye of the structure 

and the curved lines represent carbon atoms necessary for 
the formulation of cyanine dyes, X and Y are selected from 
the group consisting of 0, S, and 



wo 99/40223 



24 



PCT/US99/02104 



CH3-C-CH3, m is an integer selected from the group 
consisting of 1, 2, 3, and 4, Rl, R2, R3, R4, R5, R6 and 
R7 are independently selected from the group consisting of 
H, OH, COaH, sulfonic acid or sulfonate groups, esters, 

5 amides, ethers, alkyl or aryl groups and B, and one Rl, 
R2, R3, R4, R5, R6 or R7 is B ; 

B is a linker of at least 10 atoms in length wherein 
the atoms are selected from the group consisting of 
carbon, nitrogen, oxygen, siabstituted carbon, and sulfur 

10 and the linker is attached at one end to A and at the 
other end to G; and 

C is a dideoxynucleotide selected from the group 

consisting of 




wherein said linker is covalently bonded to said 
15 dideoxynucleotide at position 7 for ddA and ddG and at 
position 5 for ddC and ddT and wherein r is a mono or tri- 
phosphaite . 
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4. The con^KDxand of claims 3, wherein said linker is 
selected from the group consisting of 

-CeC-CHa-NH-CO- (CHi) s-NH-OO-, 
-CBC-CHa-NH-OO- (CHa) ,-NH-SOa- , 
5 -CBC-CHa-NH-OO- (CHa) io-»H-00- , 
-CEC-CHa-NH-OO- (CHa) , 

-CEC-CHa"NH-CO- (CHa) r-NH-CO- (CHj) 5- i and 
-CEC-CHa-NH-OO- (CHj) 5-NH-CO- (CH,) i«"NH-CO- . 



5. A compoxond of the formula (II) : 
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8 . A conpound of the formula (V) : 




9. A deoxyribonucleic acid sequence containing the 
5 compound of formula I. 

10. A deoxyribonucleic acid sequence containing the 
compound of formula II, III# rv, or V. 

11. A kit for DNA sequencing comprising conpounds of 
10 formula II, III, IV, and V. 
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12. The kit of claim 11, further comprising a 
thermostable DNA polymerase. 

13. The kit of claim 12, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

5 binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natiiral polymerase. 

14. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
10 molecule able to hybridize to said DNA molecule in a 

vessel containing a thermostable DNA polymerase , one 
of a set of four dye terminators with an linker of at 
least 10 atoms between the dye and the nucleotide and 
separating DNA products of the inciibating 
15 reaction according to size whereby at least a part of 

the nucleotide base sequence of said DNA molecule can 
be determined. 

15. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 
20 incubating a DtC^ molecule annealed with a primer 

molecxile able to hybridize to said DNA molecixle in a 

vessel containing a thermostable DNA polymerase, a 

conpoiind of formula I eind 

separating DNA products of the incubating 
25 reaction accoixiing to size whereby at least a part of 

the nucleotide base sequence of said DNA molectile can 

be determined. 
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16 • Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule aimealed with a primer 
molecule able to hybridize to said DNA molecule in a 
5 vessel containing a thermostable DNA, a compound of 

formula II, III, IV, or V and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecxile can 
10 be determined. 

17. The method of any of claims 14, 15, or 16 
wherein said polymerase is a thermostable DNA polymerase 
that has an altered dNMP binding site so as to improve the 
incorporation of dideoxynucleotides relative to the 
15 natural polymerase. 



wo 99/40223 PCTAJS99A>2104 

1/2 




FIGURE 2 



wo 99/40223 



1/2 



PCT/US99/02104 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(I) APPLICANT: 



Kumar, Shiv 
Nanpalli, Stayam 
McArdle, Bernard F. 
Fuller, Carl W. 



(ii) TITLE OF INVENTION: 



DIDEOXY DYE TERMINATORS 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS: 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE: 
STREET: 

CITY: 
STATE: 
COUNTRY: 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2.0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(A) NAME: 

(B) REGISTRATION NUMB ER: 

(C) REFERENCE/DOCKET NUMBER: 



Warb\irg, Richard J- 
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Other embodiments are within the following claims. 



INTERNATIONAL SEARCH REPORT 


International application No. 
PCT/US99y02l04 


A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :Please See Extra Sheet 

US CL : 435/6. 91^. 194; 530/350; 536/22.1. 23.1, 26.1 
«. _a^.. a; t piMtM* PlsMificAtian HPC^ or ta both nation ftl ctasnficatim and IPC 


a FIELDS SEARCHED 


Mtnimnm doCTimiwf f!"" seafched (diasificattoa lystein followed by cUssificatkm symbols) 
U.S. : 435/6, 9U, 194; 530/350; 53d«2.1. 23.1, 26.1 


Doconientslioii teaiebed odier tfaan mtnimom docmncotatioii to the exteat that such documeats m bduded in the fields searched 


Eleetnmte data base eoosnhed during the mtefnatioQal seaich (nai 
Pleaae See Extn Sheet 


Be of data base and. 


where practicable, 


search terms used) 


e DOCUMENTS CCH^SIDERED TO BE RELEVANT 




Category* 


Citation of documeat, with mdicatioo, where appropriate, of the r^vaat passages 


Relevant to claim No. 


X 
Y 


EVANGEUSTA ct al. Characterization of fluorescent nucleoside 
triphosphates by c^illary electrophoresis wifli laser-induced 
fluorescence detection: Actim of alkaline phosphatase and DNA 
polymerase. Analytical Biochemistry. 1996, VoL 235, pagps 89-97, 
see »tire document 


3,9 

1,2,4-8,10-17 


Y 


US 5,614,365" A (TABOR et al) 25 March 1997, see entire 
document 


1-17 


Y,P 


US 5,795,762 A (ABRAMSON et al) 18 August 1998, see entire 
document 


1-17 


[3c| FnitherdoemMotsaieHstedmtfaecoatnttatkMiofBoxC. Q See patent &aitly annex. 


tD W of iiirtiiilir nSmwaam 


T* Mu iliiiiii 
ditoantaot 
4m pmnplo 


■ oo^kiwiiidtoappaMtwa fritoitoJto iiiJiiifc^ 
or ihooiy vkdi^fna ■nmiias 


-B- — fcr doMl publ»iil co or «fty liw irtwolkinri filiat ikto 
dl^ to ii^MMi ft^ pubBotioo d1> of wuhg oitrtioD or odMr 


-jk. aam^mmtM «■ fiwiMH ■■■■ i hi i» w 

wfaM fho dooMMt it tato ftloo* 


nd to nFoira mi inwittM ilip 


«0> JuiiMil nbnwm to — oral di»do<MW. mt^ OKhifcirtm or olhir 
»p- Jufiil iwWawil y^to Iho ■nnntiwil iSIm ditobot Irtir <im 


uuMiiinil to mnif m mMuli** ttmp wbim JotMil m 
cembmrni wife ooo or wow ctfwr — eh Jowti^ mteb oCMhi—linn 
bMi« obnaw to k pma tkffldd iD tha «t 


Date of the actual Gompletioii ctf the tntemational search 
26 APRIL 1999 


Dale of masltng of the tntematioiial search report 

7 2 MAY 1999 


Naine and mailtag address of the ISA/US 
CMBMioaer flf Pateals sad Tkadcnuefcs 

Box per 

WMhngloiwD.C 20231 
FaesimUe No. (703) 305-3230 


Attthorixed ofiBcer 

JEFFREY FREDMAN 
Telephone No. (703) 308-0 196 ^ 


JOVGEmOGBB 

^ GNBSGMLMinn 



Form PCT/ISA/ZIO (second sheetXJuly 1992)^ 



INTERNATIONAL SEARCH REPORT 



InteroaticMiat applicatioB No. 
PCTAJS99/02104 



C (Continu.lioB). DOCUMENTS CONSIDERED TO BE RELEVANT 



Cat^oiy* 



Citaticm of document, with tndtcatioii. where appropiuUe. of the rdevut passages 



Rdevant to claim No. 



HARALAMBIDIS et al. Preparation of base-modified nucleosides 
suitable for non-radioactive label attadiment and dieir 
incorporation into synthetic oligodeoxyribonucleotides. Nucldc 
Acids Research. 1987, Vol 15. No. 12, pages 4857-4876, see 
entire document 

VANDER HORN et al. ThOTno sequoiase™ DNA polymerase and 
T. addophilum pyrojAo^hatase: New thermostable en^mes for 
DNA sequencing. Biotecfaniques. 1997, Vol 22. No. 4, pages 758- 
765, see mtire docummt 



1-17 



1-17 



Form PCTnSA/210 (cootiDiiatioa of accood sheetXJuly 1992>* 



INTERNATIONAL SEARCH REPORT 



InternatioDftl application No. 
PCT/US99/D2104 



A. CLASSIFICATION OF SUBJECT MATTER: 

IPC (6): C12Q 1/68; CUP 19/34; C12N 1/12; C07K 13A)0; C07H 21/02, 21/04, 19/04 

B. HELDS SEARCHED 

Elednmic data basea cooiulted (Name of data base aad whera practicable tonna used): 
APS, CAPLUS, REGISTRY, MEDUNE, BIOSIS 

seafch temr DNA, tmcleic, oligo, polynucleotide, primer, probe, sequeadng, polymerizing, potymemae, tfierasostabie, 
adjust, alter, mulatkm, aucleolide, nudeoiide, fluorescent, linker 

BOX n. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as firflows: 

This applicaliott contains the following inveatiaas or groups of bventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. 

Oroap I, claim(s)l-13. drawn to cyaaine dye nucleotides for DNA sof^eoobg and kits GOBtaming them. 



Group n, cUim(s) 14-17, drawn to methods of DNA sequeoctng. 



The inventions listed as Groups I and II do not relate to a single mventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: The daims 
lack a special technical feature becmise the Itoldag Umitatioo between the two daims is a dye with a linker of at least 10 
atoms between the dye and the nucleotide base. Nucleotides for use in DNA sequenctag with this timitation are 
expressly taught by Evaagelista et al CAnal. Biochem. (1996) 23S:S9-97). Therefore, there is no feature which 
distinguishes the daims over the prior art and oonsequeatly ao special technical feature. 



Fonn PCT/ISA/210 (extra sheetXJuty 1992)* 



INTERNATIONAL SEARCH REPORT 



latematioaal application No. 
PCT/US99/&2104 



Box I ObservatKMis where certain daima were fomd niiaearchable (Cootinttalloa of lUm 1 of first sheet) 



This tatematiooal report has not been established in lespea of oeitain cUtms under Aiticle HCZXil) &r the fi^lowmg reasons: 
Claims Nos.: 

because ihey relate to nibject matter not required to be searched by this Authority, namely: 



j I Claims Nos.: 



2. ClahnsNos.: 



because they relate to parta of the inleiiiatioaal appKc^kn that do not comply with the prescribed requtremcots to such 
an eodeat that no meantngfiil mlDnialioMU seaich can be canied out speci6caUy: 



3. Q Claims Nos.: 
beoaasetfaey 



are dqxadcat claims and are not dnfled in aoooidaBce with tiie seoood and thiid s enten ces of Rule 6j4(a)L 



Bos II OhservaHoDS where unity of Invcntloa Is lachinf (ConthmatloB of Item 2 of first sheet) 



This Intemattooal Searehtng Authority fiMod moitipie mventiaes ta this tntematioaal appticatkm, as fibttowK 
Please See Extia Sheet 



Q Aaatt required additiottdseareh fees were tinrely paid by the appKcaat^thbi^ 



2. As all searehable claima could be searched without eflfort justing an addttional fee, this Authority did not invite payment 
of any additioiial fee. 

3. (~| As only some of the reqatred additiooal seareh fees were timely paid by the afyticant, this intematiooal seareh report 
^""^ only Ihoae daimt for which fees were paid, specifically daims Nos^: 



4. rn No reipured additional seareh fees were timely paid by the applicant Conaequently, this international seareh report j 
restricted to the invention fifst mentioned in the claims; it is covered by claims Nos.: 



Remnrh om Protest The addi t ional search fees were accompanied by the appticanlTs protest 

I } No piotest a c companied tfie payment of additional seaidi fees. 



Pom PCT/ISA/ZIO (cQothmation of first shee<l)XJuly 1992)* 



